AD-A274  281 

iiiiiiiii 

UNIVERSITY  OF  CALIFORNIA,  SAN  DIEGO 


BERKELEY  •  DAVIS  •  IRVINE  •  LOS  ANGELES  •  RIVERSIDE  •  SAN  DIEGO  •  SAN  FRANCISCO 


DEPARTMENT  OF  ELECTRICAL  AND  COMPUTER  ENGINEERING  9500  GILMAN  DRIVE 

LA  JOLLA,  CALIFORNIA.  92093-0407 

December  3,  1993 


Re:  Semi-annual  report  on  ONR  grant  no:  N000 14-93- 1-0698 

Title:  Design  and  Evaluation  of  Fault  Tolerance  in  Opto-electronic  Computing 

Principle  Investigator:  Professor  Ting-Ting  Y.  Lin 

Addressees: 

Scientific  Officer,  Dr.  Clifford  G.  Lau 
Administrative  Grants  Officer 
Director,  Naval  Research  Laboratory 
Defense  Technical  Information  Center 

Dear  Sirs, 


DT1C 

ELECTE 
JAN  03  1994 

A 


tj 

Si 

a  B 

a.  2 
a  . 

ci  ® 

5  *’tj  i 

jjj7>  ~  I 

rj  •'  3 1 


.u 

‘i 

i } 

n 

fl 

;» 

n 

8 

33 

TJ 

'J 

a 

This  letter  report,  for  the  period  of  1  June  1993  through  1  December  1993,  describes  the 
activities  supported  by  the  ASSERT  grant.  A  student,  John  Lillis,  is  supported  under  this 
grant.  John  has  a  strong  background  in  theory  and  will  perform  some  analytical  work  on 
clustering  and  partitioning  (for  our  test  pipe  formation)  to  minimize  the  addition  of  test 
circuits  and  delay.  This  is  especially  useful  in  constructing  our  processing  plane.  We  started 
with  two  areas. 

The  first  area  of  research  is  Data-Flow  Clustering.  In  synchronous  circuits  (described  in 
terms  of  registers  and  combinational  nodes)  it  is  known  that  the  feedback  loops  are  dominant 
in  determining  the  achievable  clock  period  for  the  circuit.  More  precisely,  the  maximum 
delay  to  register  ratio  over  all  loops  in  the  circuit  (called  the  iteration  bound)  is  a  lower 
bound  on  the  clock  period.  Furthermore,  if  the  combinational  nodes  are  fine-grained  in 
nature,  the  iteration  bound  very  closely  reflects  the  achievable  clock  period  through  retiming. 
However,  because  of  size  constraints  it  is  often  necessary  to  partition  such  a  circuit  into 
multiple  modules  (for  example  FPGAs).  In  such  a  situation,  inter-module  delay  can  be 
substantial  compared  to  intra-module  delay.  Accordingly,  we  study  the  problem  of  clustering 
the  nodes  of  the  graph  into  different  modules  such  that  the  iteration  bound  of  the  resulting 
circuit  (where  inter-module  delays  are  taken  into  account  when  determining  the  delay- 
register  ratio  of  a  loop)  is  minimized.  We  have  shown  that  this  problem  is  NP-Complete 
even  if  replication  of  combinational  nodes  is  allowed  (ie,  provided  all  its  inputs  are  available, 
a  node  may  appear  in  two  different  modules  so  as  to  absorb  some  of  the  inter-module  delay 
costs  of  different  loops;  this  is  a  popular  method  for  minimizing  delay  in  acyclic  networks). 
Accordingly,  we  have  proposed  a  heuristic  solution  to  the  problem  and  are  conducting 
experiments  to  evaluate  its  effectiveness. 
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The  second  area  of  research  is  network  partitioning.  We  study  the  classical  problem  of 
finding  the  Minimum-Cut  in  an  undirected  graph,  given  source  and  sink  nodes,  s  and  t.  We 
reformulate  the  problem  as  a  continuous  placement  problem.  We  show  the  correspondence 
between  the  Min-Cut  problem  and  the  placement  problem  which  is  then  solved  iteratively 
using  gradient  methods.  In  addition,  this  method  has  good  potential  for  efficient  parallel 
implementation.  We  provide  experimental  data  on  the  effectiveness  of  our  approach.  This 
work  will  be  submitted  to  Information  Processing  Letters.  A  draft  of  the  paper  is  in 
Appendix  A. 

This  summarizes  our  research  progress.  Please  do  not  hesitate  to  call  me  if  this  is  not 
sufficient. 
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Appendix  A 


A  Gradient  Approach  To  The  Minimum  Cut  Problem 

Chung-Kuan  Cheng,  John  Lillis  and  Ting-Ting  Lin 
University  of  California,  San  Diego 
La  Jolla.  CA  92093 

Abstract 

We  present  a  gradient  approach  to  the  classical  Min-Cut  problem  for  capacitated  undirected 
graphs.  This  is  done  by  showing  the  correspondence  between  a  continuous  minimum  placement 
problem  and  the  min-cut  problem.  We  demonstrate  the  use  of  the  Successive  Over  Relaxation 
( SOR )  iterative  method  for  solving  the  continuous  formulation.  The  feasibility  of  t his  approach 
is  demonstrated  experimentally.  We  believe  this  method  has  potential  for  successful  parallel 
implementation 

1  Introduction 

We  present  a  gradient  approach  to  the  problem  of  finding  the  minimum  cut  in  a  weighted, 
undirected  graph  G  =  (V,  E)  given  source  and  sink  nodes  s  and  t ,  and  edge  capacities  cK]  for 
(i.j)  G  E  ( aj  is  considered  to  be  0  if  (i.j)  £  E).  Our  approach  is  inspired  by  the  Gordian-L 
placement  tool  [1],  The  paper  is  organized  as  follows: 

•  We  will  show  that  we  can  find  the  min-cut  in  a  graph  by  minimizing  the  function  F(r)  = 
53(i ,j)eE  ci,j\xi~xj  \  where  each  node  i  in  the  graph  has  a  real- valued  variable  .r,  associated 
with  it  except  xs  and  xt  which  are  fixed. 


1 


•  We  give  an  approximation  to  the  above  formulation  by  modeling  it  as  a  nonlinear  resistive 
network. 

•  We  show  how  to  minimize  the  above  approximation  by  way  of  t he  Successive  Over- 
Relaxation  ( SOR )  method  to  take  advantage  of  the  possible  sparseness  of  the  graph  and 
the  properties  of  the  admitance  matrix  derived  from  the  above  approximation. 

•  Last  we  give  experimental  results  of  this  approach. 

2  Problem  Formulation 

Let  G  —  (V.E)  be  a  graph  where  V  is  the  set  of  vertices  and  E  is  the  set  of  undirected  arcs. 
Further,  for  each  arc  (i.j)  £  E  let  c,-.j  be  the  capacity  of  that  arc. 

A  minimum  cut  problem  is,  given  .V  =  {V.  E)  and  two  vertices  .s .1  6  L.  to  partition  V  into 
disjoint  sets  i's  and  l'(  where  s  €  lj  and  t  €  L<  such  that 

C(YSA\)=  £  (1) 

i€''j  jet  t 

is  minimized.  Ie,  the  total  capacity  of  the  edges  between  the  partitions  is  minimum. 

Let  f(xj.Xj)  =  Cjjl.r,  -  .r j | .  Now  consider  the  continuous  minimimum  placement  problem 
which  minimizes  the  objective  function 

f(f)=  E  /(*«■  *i)  (2) 

With  xs  and  xt  fixed,  we  claim  the  optimal  solution  to  the  above  minimization  problem  captures 
the  minimum  cuts  of  the  graph.  In  the  following,  we  assume  that  xs  —  —  ^  and  ,r(  = 

Lemma  2.1  If  x  —  xj. . . . ,  x„  yields  the  minimum  for  expression  2.  any  rn  (x s  <  w  <  ,r(  and 
m  not  equal  any  xx)  partitions  V  into  Uro  sets  \j„(  =  {?j.r,  <  in)  and  =  {ijr,  >  m}  and 


this  partition  is  a  minimum  cut. 


Figure  1:  Illustration  of  reformulatic  t.  Each  segment  i  defines  a  cut  between  nodes  on  the  left 
of  the  segment  and  the  nodes  on  the  left.  The  value  of  this  cut  is  t, 

Proof:  Let  /m)n  be  the  minimum  cut  value  for  graph  C.  Let  p  +  1  be  the  number  of  distinct 
values  among  the  .r,‘s  and  let  a',.-  •  -a:'  ,  be  these  distinct  values  in  ascending  order  (ie.  there 
are  p  "segments"  between  .rs  and  .r().  We  define  1,  =  Ylrv<r'  Ei,  >r'  ,  ru.<-  'n  oilier  words.  /, 
is  the  total  capacity  of  the  edges  crossing  the  fth  segment.  We  also  define  d,  as  ihe  leiiglh 
of  the  /‘th  segment  (,r'+)  -  a-').  This  formulation  is  illustrated  in  the  figure  ].  Notice  that 
tl%  =  ./•(  -  .rs.  Clearly  none  of  the  t,'s  can  be  less  than  as  this  would  define  a  cheaper 
cut.  Therefore,  we  have 

v 

F(T)  =  ^  fdU  >  Cmi >,(•*'(  -  -Vs)  (•'*) 

i=l 

Thus,  in  order  to  minimize  F(x).  we  need  /,  =  C„,,„  Vi  which  correspond  to  a  cut. 

3  Approximation  By  A  Nonlinear  Resistive  Network 

We  transform  our  placement  problem  into  a  nonlinear  resistive  network.  Let  .r,  bo  ihe  voltage 
of  node  i.  Given  a  constant  c.  for  each  edge  (i.j).  we  construct  a  nonlinear  resistor  connecting 
nodes  i  and  j  with  a  conductance.  <jKj.  defined  as  follows: 


r,.j/k.  ~  -Tjl  if  l-r,  ~  >  < 

< 

fi.jA  if  l-r,  -  ij\  <  i 


(->) 


Figure  2  graphically  depicts  the  conductance  o. 

The  current  flowing  from  node  i  to  node  j  is  equal  to  the  product  of  the  voltage  difference 
| j,  -  j-j|  and  the  conductance  a,  Jk  i.e. 


( r,  -  .r  j  )cr,  j 


'-j 


if  |.r,  -  Xj  |  >  ( 


(5) 


[  r,.j(x,  -  Xj)/<  if  |.r,  -  .r,|  <  e 

where  F.j  =  1  if  .r,  >  x, :  =  -1  if  .r,  <  ./ liquation  (a)  is  an  approximation  of  the 

partial  derivative  of  our  objective  function  F  with  respect  to  x,.  Consequently.  Kirchholf's 
current  law  [3]  in  the  nonlinear  resistive  network  corresponds  to  the  necessary  condition  of  a 
zero  gradient.  We  claim  the  analogy  between  our  problem  and  the  above  resistive  network. 


Theorem  3.1  (liven  a  constant  T.  ther  exists  an  e  =  —  for  tlic  conductance  equation 

(4)  such  that  the.  coltaeje  solution  of  the  trnsfonne  el  nonlinear  n  sistive  network  is  an  uppreixunette 
solution  of  F  with  ein  error  bound  T  on  the  value  of  /  . 


Proof:  To  prove  the  theorem,  we  introduce  the  function. 


h[x)  =  { 


2j.r|  -  <  if  |.ij  >  < 
x2 /(  if  ).r |  <  e 

and  a  potential  function  approximating  the  objective  function  /- 


*(•?)  =  Yi  ■■r.,  1 

(>o) 


where 


W(X,.Xj)  -  -C,xJll(x,  -  Xj  ) 


( b  1 


(7) 


I 


Figure  2:  Illustration  of  c(.r,.x,)  mid  o(.r,..rl).  Here.  e  =  ~2  mid  r,,  —  1. 


Figure  2  graphically  depicts  c- 

We  prove  the  theorem  by  proving  the  three  following  lemmas  (proofs  appear  in  the  ap 
pendix): 

Lemma  3.1  The  solution  of  tin  nonlinear  re  si  stive  network  ('>)  eh  rices  the  minimum  'l'(.r). 

Lemma  3.2  (Him  a  positive  number  *.  |/  (.r)  —  'f'(.r)|  <  7  holds  if  <  =  jj-ym— ^ - .  where  | h 

is  the  number  of  edges  in  the  ejreiph. 

Lemma  3.3  The  inee/imlity  |  V[.r)  -  'Ff.r))  <  7  holds  if  <  <  ^^uaj-  c~~  ■ 

Therefore  from  the  above  three  lemmas,  we  have  proved  Theorem  3.1. 


0.  Set  e  =  e-init 

1.  Set  initial  x  vector 

2.  Calculate  otJ  according  to  current  x  vector 

3.  Solve  G ii.r i  =  -G12X2  as  a  linear  resistive  network, 
j.  If  Change  in  Objective  Function  is  less  than  A 

e  —  e  ■  e-factor 

■5.  Repeat  steps  2.  3  and  f  until  convergence 

Figure  3:  Iterative  algorithm  for  solving  the  continuous  placement  problem.  Step  3  is  performed 
by  the  SOR  iterative  method.  In  our  experiments.  A  is  a  function  of  e  and  the  current  objective 
'I'.  See  Experimental  Results  section  for  details. 

4  Piecewise  Linear  Algorithm 

We  propose  to  solve  our  resistive  network  by  starting  at.  an  intitial  vector  x.  Sex t.  we  treat  t lie 
circuit  as  a  linear  resistive  network,  and  find  the  solution  with  the  Successive  Over  Relaxation 
(SOR)  method  [2]  We  then  update  the  current  solution  x  and  repeat  the  process  until  the 
solution  converges  (see  figure  3).  In  the  figure,  ,rj  represents  the  variable  node  positions  and 
X2  denotes  the  fixed  node  positions  (,r5  and  xt).  <7n  and  G\i  denote  the  submatrices  of 
admittance  matrix  derived  from  o  corresponding  to  vectors  .rj  and  x-i.  Hence,  the  problem 
reduces  to  solving  Gn-ri  =  -G’12^2- 


In  addition  to  <r  (1).  we  introduce  the  following  function: 

l/(.r|  if  jx|  >  c 

9(x)  =  { 


(«) 


1  /r  if  |.r |  <  c 

In  the  following,  let  x  and  x  be  the  vector  at  the  A-th  and  A-  +  1st  iterations  respectively.  In 


(i 


addition,  for  notalional  convenience,  let 


Xij  =  fi~Xj  (9) 

1  j  —  xt  —  x  j  (19) 

We  introduce 

P{x.x)  =  Yi  ci}g{x~,j)x~tj2  ( 11 ) 

(>-j) 

p{x.  x)  =  Y  (i2) 

(..J) 

Theorem  4.1  For  each  iteration  of  steps  2,3  and  4,  the  potential  function  'p(.r)  is  strictly 
decreasing. 

We  prove  'P(.r)  >  ^(.f )  by  proving  the  two  following  lemmas  (proofs  appear  in  the  appendix). 
Lemma  4.1  Before  the  solution  converges.  P(x,x)  >  P(x.x) 

Lemma  4.2  Before  the  solution  converges. 

2¥(i)  -  P(x.x)  >  21»(.r)  -  P(x.x)  (13) 


We  conclude  the  proof  of  the  Theorem  4.1  from  the  preceding  lemmas. 

5  Experimental  Results 

In  this  section,  we  present  experimental  results  of  our  approach.  Test  graphs  were  gotten  from 
the  Washington. c  (developed  by  Richard  Anderson  and  students  at  the  University  of  Wash¬ 
ington)  program  at  DIM  ACS.  It  generates  a  variety  of  graphs  for  the  directed  version  of  the 
min-cut  problem.  We  use  a  post  process  to  convert  these  graphs  into  undirected  graphs. 

In  our  experiments,  we  fixed  x,  =  -100.0,  xt  =  100.0,  initial  c  =  200.0.  c-factor=  0.80  and 
A  =  F(.r)  ■  e  ■  10-3.  In  addition,  we  set  u,  the  relaxation  parameter  of  SOP  to  be  1.9. 
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|V| 

in 

SOU  Executions 

Total  Iterations 

CPU  time 

102 

590 

314 

8177 

5.6 

227 

1310 

423 

11538 

18.7 

-101 

23S0 

656 

12436 

40.6 

Table  1:  Experimental  results 

6  Concluding  Remarks 

We  have  demonstrated  the  theoretical  foundations  and  experimental  feasibility  of  a  gradient 
approach  to  the  Min-Cut  problem.  Future  work  may  include  detailed  analysis  of  t  he  convergence 
rate,  more  exhaustive  evaluation  of  algorithmic  parameters  and  parallel  implementation. 
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7  Appendix 

We  present  proofs  of  lemmas  stated  in  the  body  of  the  paper. 

Lemma  3.1  l he  solution  of  the  nonlinear  resistive  network ;  (r>)  derives  the  minimum  'J'(.r). 
Proof:  1  he  lemma  follows  the  reference  [.}].  pp.  77b.  Since  equation  (b)  is  continuous  in  i  1  s 
first  derivative,  its  gradient  is  equal  to  zero  at  a  minimum  solulion.  We  can  derive  that  the 


s 


KCL  equation  for  the  nonlinear  resistive  network  is  equivalent  to  the  necessary  condition  that 
the  gradient  of  (6)  is  zero,  which  proves  the  lemma. 


Lemma  3.2  Given  a  positive  number  7 .  |/-'(.r)  -  vk(.r)|  <  7  holds  if  e  -  |/.;|77iUi,  ;l~, ;  •  where  \k\ 
is  the  number  of  edges  in  the  graph. 

Proof:  Consider  the  following  two  cases: 

(i)  |.r,  -  Xj\  >  c 

From  equations  (2)  and  (G).  we  know  that 

I /(  r , .  Sj )  -  d.c,..r,  )|  =  c,.//'!  (11) 


(ii)  |.r,  -  Xj\  <  c 


Therefore,  given  c  =  we  have 

\F[r)  -  'P(.r))  <  Y,  -  *j)~  <  ‘  0r») 

(i.j) 

Lemma  3.3  The  inequality  |/'(.r)  -  'F(.r  )|  <  7  holds  if  <  <  ijrpjpfjj——  ■ 

Proof:  In  the  following,  let  .r  and  .r  be  the  respective  global  minima  of  /•(./  )  and  'J'(.r).  l.e. 


F(.r)  <  T(.r)  V.r  ( 1 1  ) 

'i’(.r)  <  'l’(.r)  V.r  (Is) 

We  can  derive 

F(.r)  -  'f'(i-)  =  /•'(./')-  /'(.I')  +  F'(.r)  -  *(i)  (  H») 

Since  F{.r)  -  F(.r)  <  0  by  definition,  from  the  previous  lemma,  we  have 


F(.r)  -  'l'(.r)  <  /'(.'  )  “  F(.r)  +  7  <  7 


(20) 


Similarly,  we  can  derive 


F(x)  -  1<(.r)  =  F(x)  -  'J'(.r)  +  'l'(.r)  -  #(*)  >  -7.  (21  ) 

Thus  from  inequalities  (20)  and  (21).  we  conclude  that 

|T(.r)  —  'l'(.r)!  <  7  (22) 

Lemma  4.1  Before  Ihe  solution  eonveiyt s.  /’(.  r..r)  >  l'(x.x) 

Proof:  We  use  the  properly  of  the  linear  resistive  network  [.’I] .  pp.  770.  that  the  iniiiimum 
power  disipation  of  the  network  occurs  at  the  only  solution  of  the  network  equation'.  1  hen  hue. 
before  the  solution  converges,  the  solution  of  f/nj-j  =  —  f/j always  derives  the  v« iltaue  that 
reduces  the  power  dissipation  of  the  linear  resistive  network,  i.e.  /’|.c..m  >  Pix.i  \ 

Lemma  4.2  Before  the  solution  com  e  ryes. 

2'l'(.r)  -  / ’( x .  r)  >  2'lM.r)  -  /’(.r. ./'•)  ■  2  <  1 

Proof:  We  can  derive  that 

2 'f'(.t')  -  P(x.x)  =  ^2  -  c, ,<•/(. ^  e,  [/,(./■.  )  -  e/(x:  ,.r  >  21  • 

<>'•  j) 

24'(t)  -  P(x.x)  =  V  c,Jh(.flJ)  -  (-,,()  (.r],  J.r),-’  =  Y~  e, ,[/,(./•,, )  -  </(./'  o'.  .  J'  1 

(’■j)  (•..,) 

We  now  show  that  in  the  following  four  cases. 

hi*,.,)  ~  >  h(F,j)  -  <j{.r,j).rIJ1  Vi.j  t -fi . 

case  (i)  |.r,,|  <  c.  |.rIJ|  <  e 

x  2  1 

lii-r,.,)  -  !){  )xfj  =  '-P - —  .r  r>  2  =  0  (27  1 

r~  2  1 

>1  (  P , )  -  y(  x , ,  )x  f  =  — - T,  j  *  =  0  ( 1. 2* ) 

(  ( 


10 


' 

case  (ii)  |x,J  <  c.  |x'u  >  < 

r  *  1 

h(rij)  -  gU.jUlj  =  -■c<ji  =  0 

(2‘M 

h{x,j)  -  y(xij)xtj-  -  2|x,_,|  -  <  -  -.r,j  -  <  <  0 

(•{()) 

case  (iii)  |x7,|  >  c,  \-r,}  <  ( 

h(*ij)  -  -  2kol  ~ (  |r-  |J<7  -  l-r'./l  f 

(di) 

(d'2) 

case  (iv)  |:(rtJ|  >  f.  \.itJ\  >  < 

=  k»J  - c 

tddi 

i  .  , 

h(*ij)  ~  ffWijtfj"  =  2kol  -s-S 

i  h  n 

Since 

■irv2  >  •2|.r',||.r'J|  -  r'j 2 

t  d'> ) 

we  liave 
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We  conclude  from  cases  (i)  -  (iv)  that 
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